Protein Structure Initiative
   HOME

TheInfoList



OR:

The Protein Structure Initiative (PSI) was a USA based project that aimed at accelerating discovery in
structural genomics Structural genomics seeks to describe the 3-dimensional structure of every protein encoded by a given genome. This genome-based approach allows for a high-throughput method of structure determination by a combination of experimental and modeling ...
and contribute to understanding biological function. Funded by the U.S.
National Institute of General Medical Sciences The National Institute of General Medical Sciences (NIGMS) supports basic research that increases understanding of biological processes and lays the foundation for advances in disease diagnosis, treatment, and prevention. NIGMS-funded scientists ...
(NIGMS) between 2000 and 2015, its aim was to reduce the cost and time required to determine three-dimensional protein structures and to develop techniques for solving challenging problems in structural biology, including membrane proteins. Over a dozen research centers have been supported by the PSI for work in building and maintaining high-throughput structural genomics pipelines, developing computational
protein structure prediction Protein structure prediction is the inference of the three-dimensional structure of a protein from its amino acid sequence—that is, the prediction of its secondary and tertiary structure from primary structure. Structure prediction is different ...
methods, organizing and disseminating information generated by the PSI, and applying high-throughput structure determination to study a broad range of important biological and biomedical problems. The project has been organized into three separate phases. The first phase of the Protein Structure Initiative (PSI-1) spanned from 2000 to 2005, and was dedicated to demonstrating the feasibility of high-throughput structure determination, solving unique protein structures, and preparing for a subsequent production phase. The second phase, PSI-2, focused on implementing the high-throughput structure determination methods developed in PSI-1, as well as
homology modeling Homology modeling, also known as comparative modeling of protein, refers to constructing an atomic-resolution model of the "''target''" protein from its amino acid sequence and an experimental three-dimensional structure of a related homologous pr ...
and addressing bottlenecks like modeling
membrane protein Membrane proteins are common proteins that are part of, or interact with, biological membranes. Membrane proteins fall into several broad categories depending on their location. Integral membrane proteins are a permanent part of a cell membrane ...
s. The third phase, PSI:Biology, began in 2010 and consisted of networks of investigators applying high-throughput structure determination to study a broad range of biological and biomedical problems. PSI program ended on 7/1/2015, even that some of the PSI centers continue structure determination supported by other funding mechanisms.


Phase 1

The first phase of the Protein Structure Initiative (PSI-1) lasted from June 2000 until September 2005, and had a budget of $270 million funded primarily by NIGMS with support from the
National Institute of Allergy and Infectious Diseases The National Institute of Allergy and Infectious Diseases (NIAID, ) is one of the 27 institutes and centers that make up the National Institutes of Health (NIH), an agency of the United States Department of Health and Human Services (HHS). NIAID's ...
. PSI-1 saw the establishment of nine pilot centers focusing on structural genomics studies of a range of organisms, including ''
Arabidopsis thaliana ''Arabidopsis thaliana'', the thale cress, mouse-ear cress or arabidopsis, is a small flowering plant native to Eurasia and Africa. ''A. thaliana'' is considered a weed; it is found along the shoulders of roads and in disturbed land. A winter a ...
'', ''
Caenorhabditis elegans ''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' (ro ...
'' and ''
Mycobacterium tuberculosis ''Mycobacterium tuberculosis'' (M. tb) is a species of pathogenic bacteria in the family Mycobacteriaceae and the causative agent of tuberculosis. First discovered in 1882 by Robert Koch, ''M. tuberculosis'' has an unusual, waxy coating on its c ...
''. During this five-year period over 1,100 protein structures were determined, over 700 of which were classified as "unique" due to their < 30%
sequence similarity Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
with other known protein structures. The primary goal of PSI-1, to develop methods to streamline the structure determination process, resulted in an array of technical advances. Several methods developed during PSI-1 enhanced
expression Expression may refer to: Linguistics * Expression (linguistics), a word, phrase, or sentence * Fixed expression, a form of words with a specific meaning * Idiom, a type of fixed expression * Metaphorical expression, a particular word, phrase, o ...
of recombinant proteins in systems like ''
Escherichia coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Escher ...
'', ''
Pichia pastoris ''Pichia pastoris'' is a species of methylotrophic yeast. It was found in the 1960s, with its feature of using methanol as a source of carbon and energy. After years of study, ''P. pastoris'' was widely used in biochemical research and biotech ...
'' and insect cell lines. New streamlined approaches to cell cloning, expression and
protein purification Protein purification is a series of processes intended to isolate one or a few proteins from a complex mixture, usually cells, tissues or whole organisms. Protein purification is vital for the specification of the function, structure and interact ...
were also introduced, in which robotics and software platforms were integrated into the protein production pipeline to minimize required manpower, increase speed, and lower costs.


Phase 2

The second phase of the Protein Structure Initiative (PSI-2) lasted from July 2005 to June 2010. Its goal was to use methods introduced in PSI-1 to determine a large number of proteins and continue development in streamlining the structural genomics pipeline. PSI-2 had a five-year budget of $325 million provided by
NIGMS The National Institute of General Medical Sciences (NIGMS) supports basic research that increases understanding of biological processes and lays the foundation for advances in disease diagnosis, treatment, and prevention. NIGMS-funded scientist ...
with support from the
National Center for Research Resources The National Center for Research Resources (NCRR) was a center within the National Institutes of Health a United States government agency. NCRR provided funding to laboratory scientists and researchers for facilities and tools in the goal of curi ...
. By the end of this phase, the Protein Structure Initiative had solved over 4,800 protein structures; over 4,100 of these were unique. The number of sponsored research centers grew to 14 during PSI-2. Four centers were selected as Large Scale centers, with a mandate to place 15% effort on targets nominated by the broader research community, 15% on targets of biomedical relevance, and 70% on broad structural coverage; these centers were the Joint Center for Structural Genomics (JCSG), the
Midwest Center for Structural Genomics The Midwestern United States, also referred to as the Midwest or the American Midwest, is one of four census regions of the United States Census Bureau (also known as "Region 2"). It occupies the northern central part of the United States. I ...
(MCSG), the
Northeast Structural Genomics Consortium The points of the compass are a set of horizontal, Radius, radially arrayed compass directions (or Azimuth#In navigation, azimuths) used in navigation and cartography. A compass rose is primarily composed of four cardinal directions—north, east ...
(NESG), and the New York SGX Research Center for Structural Genomics (NYSGXRC). The new centers participating in PSI-2 included four specialized centers: Accelerated Technologies Center for Gene to 3D Structure (ATCG3D), the
Center for Eukaryotic Structural Genomics Center or centre may refer to: Mathematics * Center (geometry), the middle of an object * Center (algebra), used in various contexts ** Center (group theory) ** Center (ring theory) * Graph center, the set of all vertices of minimum eccentrici ...
(CESG), the Center for High-Throughput Structural Biology (CHTSB), a branch of the Structural Genomics of Pathogenic Protozoa Consortium taking that institution's place), the Center for Structures of Membrane Proteins (CSMP), and the
New York Consortium on Membrane Protein Structure New is an adjective referring to something recently made, discovered, or created. New or NEW may refer to: Music * New, singer of K-pop group The Boyz Albums and EPs * ''New'' (album), by Paul McCartney, 2013 * ''New'' (EP), by Regurgitator, ...
(NYCOMPS). Two
homology modeling Homology modeling, also known as comparative modeling of protein, refers to constructing an atomic-resolution model of the "''target''" protein from its amino acid sequence and an experimental three-dimensional structure of a related homologous pr ...
centers, the Joint Center for Molecular Modeling (JCMM) and
New Methods for High-Resolution Comparative Modeling New is an adjective referring to something recently made, discovered, or created. New or NEW may refer to: Music * New, singer of K-pop group The Boyz Albums and EPs * ''New'' (album), by Paul McCartney, 2013 * ''New'' (EP), by Regurgitator, ...
(NMHRCM) were also added, as well as two resource centers, the PSI Materials Repository (PSI-MR) and the PSI Structural Biology Knowledgebase (SBKB). The
TB Structural Genomics Consortium TB or Tb may refer to: Science and technology Computing * Terabyte (TB), a unit of information (often measuring storage capacity) * Terabit (Tb), a unit of information (often measuring data transfer) * Thunderbolt (interface) * Test bench Vehicle ...
was removed from the roster of supported research centers in the transition from PSI-1 to PSI-2. Originally launched in February 2008, the SBKB is a free resource that provides information on protein sequence and keyword searching, as well as modules describing target selection, experimental protocols, structure models, functional annotation, metrics on overall progress, and updates on structure determination technology. Like the PDB, it is directed by Dr. Helen M. Berman and hosted at
Rutgers University Rutgers University (; RU), officially Rutgers, The State University of New Jersey, is a Public university, public land-grant research university consisting of four campuses in New Jersey. Chartered in 1766, Rutgers was originally called Queen's ...
. The PSI Materials Repository, established in 2006 at the Harvard Institute of Proteomics, stores and ships PSI-generated plasmid clones. Clones are sequence-verified, annotated and stored in the DNASU Plasmid Repository, currently located at the Biodesign Institute at Arizona State University. As of September 2011, there are over 50,000 PSI-generated plasmid clones and empty vectors available for request through DNASU in addition to over 147,000 clones generated from non-PSI sources. Plasmids are distributed to researchers worldwide. Now called the PSI:Biology Materials Repository, this resource has a five-year budget of $5.4 million and is under the direction of Dr. Joshua LaBaer, who moved to Arizona State University in the middle of 2009, taking the PSI:Biology-MR with him.


Phase 3

The third phase of the PSI was called PSI:Biology and was intended to reflect the emphasis on the biological relevance of the work. During this phase, highly organized networks of investigators were applying the new paradigm of high-throughput structure determination, which was successfully developed during the earlier phases of the PSI, to study a broad range of important biological and biomedical problems. The network included centers for high-throughput structure determination, centers for membrane protein structure determination, consortia for high-throughput-enabled structural biology partnerships, the SBKB and the PSI-MR. In September 2013 NIH announced that PSI would not be renewed after its third phase would end in 2015.


Impact

As of January 2006, about two thirds of worldwide
structural genomics Structural genomics seeks to describe the 3-dimensional structure of every protein encoded by a given genome. This genome-based approach allows for a high-throughput method of structure determination by a combination of experimental and modeling ...
(SG) output was made by PSI centers. Of these PSI contributions over 20% represented new
Pfam Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 35.0, was released in November 2021 and contains 19,632 families. Uses ...
families, compared to the non-SG average of 5%. Pfam families represent structurally distinct groups of proteins as predicted from
sequenced In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which suc ...
genomes. Not targeting
homologs A couple of homologous chromosomes, or homologs, are a set of one maternal and one paternal chromosome that pair up with each other inside a cell during fertilization. Homologs have the same genes in the same loci where they provide points alon ...
of known structure was accomplished by using sequence comparison tools like
BLAST Blast or The Blast may refer to: *Explosion, a rapid increase in volume and release of energy in an extreme manner *Detonation, an exothermic front accelerating through a medium that eventually drives a shock front Film * ''Blast'' (1997 film), ...
and
PSI-BLAST In bioinformatics, BLAST (basic local alignment search tool) is an algorithm and program for comparing primary biological sequence information, such as the amino-acid sequences of proteins or the nucleotides of DNA and/or RNA sequences. A BLA ...
. Like the difference in novelty as determined by discovery of new Pfam families, the PSI also discovered more
SCOP A ( or ) was a poet as represented in Old English poetry. The scop is the Old English counterpart of the Old Norse ', with the important difference that "skald" was applied to historical persons, and scop is used, for the most part, to designa ...
folds and superfamilies than non-SG efforts. In 2006, 16% of structures solved by the PSI represented new SCOP folds and superfamilies, while the non-SG average was 4%. Solving such novel structures reflects increased coverage of protein fold space, one of the PSI's main goals. Determining the structure a novel protein allows
homology modeling Homology modeling, also known as comparative modeling of protein, refers to constructing an atomic-resolution model of the "''target''" protein from its amino acid sequence and an experimental three-dimensional structure of a related homologous pr ...
to more accurately predict the fold of other proteins in the same structural family. While most of the structures solved by the four large-scale PSI centers lack functional annotation, many of the remaining PSI centers determine structures for proteins with known biological function. The TB Structural Genomics Consortium, for example, focused exclusively on functionally characterized proteins. During its term in PSI-1, it deposited structures for over 70 unique proteins from ''
Mycobacterium tuberculosis ''Mycobacterium tuberculosis'' (M. tb) is a species of pathogenic bacteria in the family Mycobacteriaceae and the causative agent of tuberculosis. First discovered in 1882 by Robert Koch, ''M. tuberculosis'' has an unusual, waxy coating on its c ...
'', which represented more than 35% of total unique ''M. tuberculosis'' structures solved through 2007. In following with its biomedical theme to increase coverage of phosphotomes, the NYSGXRC has determined structures for about 10% of all human
phosphatase In biochemistry, a phosphatase is an enzyme that uses water to cleave a phosphoric acid Ester, monoester into a phosphate ion and an Alcohol (chemistry), alcohol. Because a phosphatase enzyme catalysis, catalyzes the hydrolysis of its Substrate ...
s. The PSI consortia have provided the overwhelming majority of targets for the Critical Assessment of Techniques for Protein Structure Prediction (CASP), a community-wide, biannual experiment to determine the state and progress of
protein structure prediction Protein structure prediction is the inference of the three-dimensional structure of a protein from its amino acid sequence—that is, the prediction of its secondary and tertiary structure from primary structure. Structure prediction is different ...
. A major goal during the PSI:Biology phase is to utilize the high-throughput methods developed during the initiative's first decade to generate protein structures for functional studies, broadening the PSI's biomedical impact. It is also expected to advance knowledge and understanding of membrane proteins.


Criticism

The PSI has received notable criticism from the
structural biology Structural biology is a field that is many centuries old which, and as defined by the Journal of Structural Biology, deals with structural analysis of living material (formed, composed of, and/or maintained and refined by living cells) at every le ...
community. Among these charges is that the main product of the PSI – PDB files of proteins' atomic coordinates as determined by
X-ray crystallography X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
or
NMR spectroscopy Nuclear magnetic resonance spectroscopy, most commonly known as NMR spectroscopy or magnetic resonance spectroscopy (MRS), is a spectroscopic technique to observe local magnetic fields around atomic nuclei. The sample is placed in a magnetic fiel ...
– are not useful enough to
biologist A biologist is a scientist who conducts research in biology. Biologists are interested in studying life on Earth, whether it is an individual Cell (biology), cell, a multicellular organism, or a Community (ecology), community of Biological inter ...
s to justify the project's $764 million cost. Critics note that money currently spent on the PSI could have otherwise funded what they consider worthier causes: A short response to this was published: In October 2008 the
NIGMS The National Institute of General Medical Sciences (NIGMS) supports basic research that increases understanding of biological processes and lays the foundation for advances in disease diagnosis, treatment, and prevention. NIGMS-funded scientist ...
hosted a meeting concerning the future of structural genomics efforts and invited speakers from the PSI Advisory Committee, members of the NIGMS Advisory Council, and interested scientists who had no previous involvement with the PSI. Representatives of other genomics, proteomics, and structural genomics initiatives, as well as scientists from academia, government, and industry were also included. Based on this meeting and the subsequent recommendations from the PSI Advisory Committee, a concept-clearance document was released in January 2009 describing what a third phase of the PSI might entail. Most notable was a large emphasis on partnerships and collaborations to ensure that the majority of PSI research is focused on proteins of interest to the broader research community as well as efforts to make PSI products more accessible to the research community. Grant applications for PSI:Biology were submitted by October 29, 2009. See Phase 3 section above.


External links


Protein Structure Initiative (PSI)Structural Biology KnowledgebasePSI:Biology-Materials Repository
* Open Protein Structure Annotation Network (TOPSAN), a
wiki A wiki ( ) is an online hypertext publication collaboratively edited and managed by its own audience, using a web browser. A typical wiki contains multiple pages for the subjects or scope of the project, and could be either open to the pu ...
for annotation of protein structures determined by the PSI


References

{{reflist, 2 Protein structure Genome projects